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A method of cloning proteins in yeast and cellulase from 
Humicola insolens. 

FIEIiD OF INVENTION 

5 The present invention relates to a method of screening for DNA 
sequences coding for proteins of interest, as well as to a 
process for producing such proteins of interest. 

BACKGROUND OF THE INVENTION 

10 

The advent of recombinant DNA techniques has made it possible 
to select single protein components with interesting properties 
and produce them on a large scale. This represents an 
improvement over the previously employed production process 

15 using microorganisms isolated from nature and producing a 
mixture of proteins which would either be used as such or 
separated after the production step. However, the conventional 
cloning techniques have the drawback that each protein 
component has to be purified and characterized by its (partial) 

20 amino acid sequence before it is possible to prepare synthetic 
oligonucleotide probes for hybridization experiments. Since 
this is a rather time-consuming process, the cloning of novel 
proteins might be considerably expedited by using a screening 
method involving selecting clones expressing, a desired protein 

25 activity. 

Such a screening method has previously been devised for the 
cloning of prokaryotic gene products in Bacillus , cf. US 
4,469,791; P. Cornells et al*, Mol . Gen. Genet. 18 6 , 1982, pp. 
30 507-511; I. Palva, Gene 19, 1982, pp. 81-87; S.A. Ortlepp, Gene 
23., 1983, pp. 267-276; H. Yamazaki et al., J. Bacterid . 156 , 

1983, pp. 327-337; N. Tsukagoshi et al . , Mol, Gen. Genet. 193 , 
^ 1984, pp. 58-63; M. Sibakov and I. Palva, Eur. J. Biochem. 145 , 

1984, pp. 567-572; and J.R. Mielenz, Proc. Natl. Acad. Sci. USA 
35 80, 1983, pp. 5975-5979. A screening method based on expression 

cloning of eukaryotic genes in mammalian cells has been 
described, e.g. in D.P. Gearing et al . , The EMBO J. 8., 1989, 
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pp. 3667—3676; N. Harada et al., Proc. Nat:l. Acad. Sci> USA 87 ^ 
1990, pp. 857-861; and R. Fukunaga et al. , Cell 61 , 1990, pp. 
341-350. 

5 SUMMARY OF THE INVENTION 

It has now been found possible to screen for yeast clones 
expressing protein activities of interest with a view to 
isolating DNA coding for single protein components. 

10 

Accordingly, the present invention relates to a method of 
screening for a DNA sequence coding for a protein of interest, 
the method comprising 

15 (a) cloning, in suitable vectors, a DNA library from an 
organism suspected of producing one or more proteins of 
interest, 

(b) transfonaing suitable yeast host cells with said vectors, 

20 

(c) culturing the host cells under suitable conditions to 
express any protein of interest encoded by a clone in the DNA 
library, and 

25 (d) screening for positive clones by determining any activity 
of a protein expressed in step (c) . 

As indicated above, expression cloning of prolcaryotic genes in 
Bacillus has previously been described. The prokaryotic systems 

30 devised for expression cloning, however, are not operable for 
the cloning of eukaryotic genes which are generally difficult 
to express in Bacillus . While expression cloning of eukaryotic 
genes ±n mammalian cells has been described, it is more 
advantageous to use yeast as a host organism as it is possible 

35 to obtain a much higher trems formation frequency than with 
mammalian cells, and as yeast is far easier to cultivate. 
Furthermore, the yeast clones are stable whereas the mammalian 
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expression cloning system described in the references cited 
above is based on transient expression in COS cells. Unlike the 
mammalian system, the yeast system results in pure clones after 
^ the initial screening and, therefore, they need not be screened 

5 in pools and subpools as in the mammalian system. Apart from 
this conventional selection systems may be used to select yeast 
transf ormants . 

According to the present invention, it has surprisingly been 
10 found that yeast cells appear to be able to express 
heterologous genes extracellularly by means of heterologous 
secretion signals in amounts which are sufficient for screening 
purposes. Although expression cloning of certain proteins in 
yeast has been described previously (G.L. McKnight and B.L. 
15 McConaughy, Proc. Nat. Acad. Sci. USA 80 , 1983, pp. 4412-4416), 
it has not been generally useful as it is based on 
complementation of essential genes and therefore is dependent 
on yeast host strains which have been mutated to lack these 
essential genes. In the present screening method, no such 
20 requirement is necessary for the yeast host strain to be used 
in the method. Besides, the gene products of the previously 
described method are intracellular rather than extracellular 
as in the present method. 

25 The advantage presented by the present screening method is 
primarily that it requires no prior knowledge of the structure 
of the protein of interest. This means that the rate at which 
novel genes may be isolated and, consequently, novel products 
be developed may be greatly increased. Furthermore, the method 

30 permits screening for multiple protein activities and may even 
result in the isolation of several different genes coding for 
the same type of proteins. 

In another aspect, the present invention relates to a process 
35 for producing a protein of interest in a heterologous host 
cell, the process comprising transforming a suitable 
heterologous host cell with a DNA sequence coding for a protein 
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of interest, which DNA sequence has been isolated by the 
screening' method of the invention^ culturing the transformed 
cells under suitable conditions to express the protein, and 
recovering the expressed protein from the culture. 

5 

In a further aspect, the present invention relates to an enzyme 
which exhibits cellulase activity, and which has the following 
characteristics 

10 (a) the DNA sequence encoding the enzyme has been isolated from 
a DNA library of Humicola insolens . 

(b) said DNA sequence comprises at least one of the following- 
partial sequences 



15 



20 



25 



30 



(i) 



(* • « 



XXX) 



(iv) 



TGGCAGCAGT GTGGTGGCGT 
CCTGTGTGTC CGGTTACACG 
GTACAGCCAA TGC 
(SEQ ID#1) 

CAGCGCAGCC GACGACGTTA 
AGGGCAACAT CGACAACAA6 
CAACCACTCCG G 
(SEQ ID#2) 



TGGCTTCTCG 
TGCGTGTACT 



GGCTCTACGT 
TGAACGACTG 



CGGACAACAC 
GTCAGCCCCG 



AACAACGACC 
GCTGCCACTT 



CCAAGGCGAA 
GCPGAGTTCG 
CTTACTTCCT 



GTTCAAGTGG 
6CAAGGAGA6 
TCGCGACGTC 



TTGGCATCAA 
TATCCGGCTA 
GTCGATTCAA 



CCAGTCCTGC 
TGGGCAA6CA 
GCGCACATCA 



ATCGTGGCTT CA (SEQ ID#3) 



CTGAC6TGAA 



CGTGACCAAC 



AACAACTTGG 



CCGTAGCGAC 



CGAGAACAAG CTGTGTACCA GATGCATCA (SEQ ID#4) 



35 



(V) 



GGACGGTCCG GCACGAGCAC GGCCTGCGTC AGCACCCAGG 

TCGGCCTTCA GCGCGTCATT GGCGCGACCA ACTGGCTCAG 

GCAAAACGGC AAGGTTGGAC TGCTCGC6AC TTGCCGCGGC (SEQ 
ID#5) 
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(vi) GCCAAGTGGG TTTGCCAGCA GGCCATTGAG GGCATGCTGA 

ACCACCTCCA GGAGAATAGC GATGTCTGGA CAGGTGCGCT 
CTGGTGGGCG GGAGGCCCGT GGTGGGGTTG ACTATATCTA (SEQ 
ID#6) 

5 

(c) the enzyme comprises a cellulose-binding domain, and 

(d) the enzyme exhibits endocellulase activity in the presence 
of linear alkyl benzene sulfonate. 

10 

The enzyme of the invention may be isolated by the method of 
the invention - 

In the present context, the term "cellulose-binding domain" is 
15 intended to indicate an amino acid sequence capable of 
effecting binding of the enzyme to a cellulosic substrate. 
Cellulose-binding domains have been found to be important for 
the endoglucanase activity of cellulytic enzymes on substrates 
(cf - the discussion in PCT/DK9 1/00124) . The term "endocellulase 
20 activity" refers to the ability of the enzyme to degrade 
cellulose to glucose, cellobiose, triose and other cello- 
oligosaccharides, as determined by the formation of clearing 
zones in a carboxymethyl cellulose (CMC) gel under, the 
conditions specified below. Unlike the endocellulase described 
25 in PCT/DK91/00123) , the enzyme of the present invention shows 
. substantially unchanged stability in the presence of linear 
alkyl benzene sulfonates. This is an important advantage as 
linear alkyl benzene sulfonates are commonly used in detergent 
compositions. 

30 

DETAILED DISCLOStJRE OF THE INVENTION 

According to the invention, the DNA library is preferably a 
cDNA library prepared from the mRNA of an organism suspected 
35 of producing one or more proteins of interest. Although it may 
also be possible to screen genomic libraries in this manner, 
at least some potential yeast hosts may not be able to splice 
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eu]caryotic genomic DNA correctly^ and -therefore a positive 
result of the screening may more often be obtained by using 
cDNA instead. 

5 To ensure a more accurate result, it may be an advantage to 
STibject positive clones isolated in step (d) of the present - 
method to rescreening, reisolation and recloning. 

The organism suspected of producing one or more proteins of 
10 interest is typically a eukaryotic organism, in particular a 
fungus since fiingi are Jcnown to produce a large number of 
different proteins which makes the traditional process of 
isolating a gene coding for a particular protein product by 
initially purifying each protein separately particularly 
15 cumbersome. This maJces it particularly advantageous to screen 
fungal DNA libraries by the method of the invention because a 
large number of different protein activities (and DNAs coding 
for them) may be identified within a relatively short time-span 
using the same library- In this respect, screening of yeast 
20 colonies for different protein activities is far more efficient 
than screening of filamentous fungi as a large number (i.e. 
about 500-1000) of yeast colonies may be grown on each plate, 
compared to 10-50 filamentous fungi/plate. 

25 One type of industrially useful proteins currently obtained 
from fimgi is enzymes. Thus, yeast clones may be screened by 
the method of the invention for expression of one or more 
enzyme activities by means of appropriate assays. Examples of 
enzymes which may be identified by this method are 

30 carbohydrases, e.g. cellulytic enzymes such as endocellulases , 
cellobiohydrolases, p-glucanases or j3-glucosidases, 
hemicellulases or pectinolytic enzymes such as galactanases , 
galactosidases, mannanases, xylanases, pectinases, xylosidases, 
arabanases, rhamnogalacturonases or amylases; esterases, e.g. 

35 lipolytic enzymes such as lipases; proteases; oxidoreductases , * 
e.g. peroxidases, oxidases or laccases; or isomerases, e.g. 
glucose isomerase. 
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A wide range of indicator systems for the different types of 
enzymes may be used for the screening of yeast colonies on agar 
plates. For instance, endocellulases may be identified by 
clearing zones in carboxymethyl cellulose after staining with 
5 Congo Red; similar methods may be used to detect glucanases, 
xylanases and galactanases. Endoarabanases may be identified 
by blue zones obtained after dissolution of azurine-crosslinked 
araban. This principle is general and may be used to detect, 
e.g., mannanases, xylanases and cellulases. Pectinases 

10 (polygalacturonases and pectin lyases) may be identified by 
clearing zones in pectin after precipitation with quaternary 
ammonium ions. Amylases may be identified by clearing zones in 
starch after visualisation with iodine, a-galactosidases may 
be detected by the release of p-nitrophenol (yellow) from p- 

15 nitrophenol-a^galactopyranoside or by coupling released 
naphthole or naphthoic derivatives from, e.g. l-naphthole-a- 
galactopyranoside to azo dyes; similar methods may be used to 
detect )3-galactosidases, a- and )S-glycosidases, /3-xylosidase 
and j9-mannosidase , Numerous methods are available for the 

20 detection of proteases, e.g. clearing zones in casein after 
precipitation with trichloroacetic acid. Peroxidases and 
oxidases may be detected by the reaction of 4-aminoantipyrine 
with ESBT (N-ethyl-N-sulfobutyl-m-toluidine) in the presence 
of hydrogen peroxide (generating a purple colour) . Lipases may 

25 be detected by the formation of clearing zones in tributyrine 
emulsions. 

The yeast strain selected to be the host cell for the DNA 
library may be any yeast strain conventionally used for the 

30 cloning of heterologous DNA sequences. Thus, the yeast strain 
may suitably be selected from Saccharomyces sp. , such as 
Saccharomyces cerevisiae , Saccharomyces kluyveri . Saccharomyces 
uvarum or Schizosaccharomvces pombe . Hansenula sp. Pichia sp. , 
Yarrowia sp. such as Yarrowia lipolytica . or Kluweromvces sp. 

35 such as Kluweromyces lactis . 
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The vector used for cloning the DNA library may be any vector 
which may conveniently be subjected to recombinant DNA 
procedures. In each vector, the DNA sequence derived from the 
library should be operably connected to a suitable promoter 
5 sequence. The promoter may be any DNA sequence which shows 
transcriptional activity in the yeast cell. Examples of 
suitable promoters for use in " yeast host cells include 
promoters from yeast glycolytic genes (HitzemaA et al. , 
Biol. Chem. 255 . 1980, pp. 12073-12080; Alber and Kawasaki, is. 
10 Mol. AppI. Gen. 1, 1982, pp. 419-434) or alcohol dehydrogenase 
genes (Young et al., in Genetic Enginee ring of Microorganisms 
for Chemicals (Hollaender et al, eds.). Plenum Press, New York, 
1982) , or the TPIl (US 4, 599, 311) or ADH2-4C (Russell et al., 
NatTxre 304 . 1983, pp. 652-654) promoters. 



15 



20 



Each DNA library sequence may also be operably connected to a 
suitable terminator, such as the TPIl (Alber and Kawasaki, oe^ 
eit. ^ or ADH3 (McKnight et al. , op. cit. ) or yeast MFa 
terminators . 



The vector may further comprise a DNA sequence enabling the 
vector to replicate in yeast cell. An example of such a 
sequence is the yeast plasmid 2a* replication genes REP 1-3 and 
origin of replication. If the vector is a yeast/g^. coli shuttle 

25 vector, it will also include an origin of replication region 
which is functional in coli . The vector may also comprise 
a selectable marker, e.g. a gene the product of which 
complements a defect in the host cell such as TJRA3, or one 
which confers resistance to a drug, e.g. ampicillin, kanamycin, 

30 chloramphenicol, tetracyclin, etc., or the Schizosaccharomyces 
pombe TPI gene (described by P.R. Russell, Gene 40, 1985, pp. 
125-130) . 

The procedures used to ligate the DNA library sequences, the 
35 promoter and the terminator, respectively, and to introduce 
them into suitable vectors containing the information necessary 
for replication, are well known to persons skilled in the art 
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(cf . , for instance, Sainbrook et al.. Molecular Cloning: A 
Laboratory Manual . Cold Spring Harbor, New York, 1989), The 
transformation of yeast cells may for instance be effected by 
protoplast formation followed by transformation or by the LiAc 
5 method in a manner known per se . 

In the process of the invention of producing a protein of 
interest after the DNA coding for the protein has been isolated 
by the screening method described above, the heterologous host 
10 cell transformed with the isolated DNA sequence may be a strain 
of a filamentous fungus, e.g. fungi belonging to the groups 
Phycomycetes, Zygomycetes, Ascomycetes, Basidiomycetes or Fungi 
Imperfecti, including Hyphomycetes such as the genera 
Aspercrillus p Trlchoderma . Penicillium > Fusarium or Humicola . 

15 

The filamentous fungus host organism may conveniently be one 
which has previously been used as a host for producing 
recombinant proteins, -e.g. a strain of Aspergillus sp. , such 
as A_i. niaer , A. nidulans or A^. orvzae . The use of A^ oryzae in 
20 the production of recombinant proteins is extensively described 
in, e.g. EP 238 023. 

In particular when the host organism is A^ orvzae . a preferred 
promoter for use in the process of the present invention is the 
25 A^ oiryzae TAKA amylase promoter as it exhibits a strong 
transcriptional activity in A. oryzae . The sequence of the TAKA 
amylase promoter appears from EP 238 023. 

Termination and polyadenylation sequences may suitably be 
30 derived from the same sources as the promoter. 

The techniques used to transform a fungal host cell may 
suitably be as described in EP 238 023. 

35 The medium used to culture the transformed host cells may be 
any conventional medium suitable for growing Aspergillus cells. 
The mature protein secreted from the host cells may 
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conveniently be recovered from the culture medivim by well-known 
procedures including separating the cells from the medixm by 
centrifugation or filtration, and precipitating proteinaceous 
components of the medium by means of a saJLt such as ammonium 
5 sulphate, followed by chromatographic procedures such as ion 
exchange chromatography, affinity chromatography, or the like. - 

A preferred endocellulase enzyme according to the invention is 
cm enzyme, a crude extract (15 ^1) of which diluted witb one 

10 volume of 0.15% linear alkyl benzene sulfonate and added to a 
2% agarose gel containing 1% carboxymethyl cellulose in 50 mM 
sodium phosphate buffer, pH 7, mixed with one volume of 0.15% 
linear alkyl sulfonate fojnns a cliearing zone in said agarose 
gel after la hours of incubation, which clearing zone is equal 

15 to (less 3 mm) the clearing zone foimed in a similar 
carboxymethyl cellulose gel not containing any linear alkyl 
benzene sulfonate, provided that the concentration of enzyme 
in the extract is such that a clearing zone of at least 10 mm 
is formed in a carboxymethyl cellulose gel (with no linear 

20 all^l benzene sulfonate) under the conditions specified above^ 

The DNA sequence coding for the enzyme may for instance be 
isolated by screening a cDNA library of Humicola insolens, e.g 
strain DSM 1800, deposited on 1 October 1981 at the Deutsche 
Sammlung von Mikroorganismen in accordance with the provisions 
of the Budapest Treaty and selecting for clones expressing the 
appropriate enzyme activity (i.e. endocellulase activity as de- 
fined above) . The appropriate DNA sequence may then be isolated 
from the clone by standard procedures, e.g. as described in 
Example 1 . 

In a fuirther aspect, the invention relates to a detergent 
additive comprising the enzyme of the invention. The detergent ' 
additive may suitably be in the form of a non-dusting granula- 
35 te, stabilized liquid or protected enzyme. Non-dusting 
granulates may be produced e.g. according to US 4,106,991 and 
4,661,452 (both to Novo Industri A/S) and may optionally be 
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coated by methods known in the art. Liquid enzyme preparations 
may, for instance, be stabilized by adding a polyol such as 
propylene glycol, a sugar or sugar alcohol, lactic acid or 
boric acid according to established methods. Other enzyme 
5 stabilizers are well known in the art. Protected enzymes may 
be prepared according to the method disclosed in EP 238 216. 

It will be understood that the detergent additive may further 
include one or more other enzymes, such as a protease, lipase, 
10 peroxidase or amylase, conventionally included in detergent 
additives . 

- In a still further aspect, the present invention relates to a 
detergent composition comprising the enzyme of the invention. 
15 The detergent composition of the invention may be in any 
convenient form, e.g. as powder, granules or liquid. A liquid 
detergent may be aqueous, typically containing up to 90% water 
and 0-20% organic solvent. 

20 The detergent composition comprises a surfactant which may be 
anionic, non-ionic, cationic, amphoteric or a mixture of these 
types. The detergent will usually contain 0-50% anionic 
surfactant such as linear alkyl benzene sulphonate (LAS), 
alpha-olefin sulphonate (AOS), alkyl sulphate (AS), alcohol 

25 ethoxy sulphate (AES) or soap. It may also contain 0-40% non- 
ionic surfactant such as nonyl phenol ethoxylate or alcohol 
ethoxylate- Furthermore, it may contain a polyhydroxy fatty 
acid amide surfactant (e.g. as described in WO 92/06154) . 

30 The detergent composition may additionally comprise one or more 
other enzymes, such as an amylase, lipase, peroxidase, oxidase 
or protease. 

The pH (measured in aqueous detergent solution) will usually 
35 be neutral or alkaline, e.g. 7-11. The detergent may contain 
1-40% of a detergent builder such as zeolite, phosphate, 
phosphonate, citrate, NTA, EDTA or DTPA, alkenyl succinic 
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anhydride, or silicate, or it may be unbuilt (i.e. essentially 
free from a detergent builder) . It may also contain other 
conventional detergent ingredients, e.g. fabric conditioners, 
foam boosters, bleaching agents, e.g. perborate, percarbonate, 
5 tetraacetyl ethylene diamine (TAED) , or nonanoyloxybenzene 
sulfonate (NOBS) , anti-corrosion agents, soil-suspending 
agents , sequestering agents , anti— soil redeposition agents , 
stabilizing agents for the enzyme (s) , foam depressors, dyes, 
bactericides, optical brighteners or perf tames. 

10 

Particular forms of detergent composition within the scope of 
the invention include: 

a) A detergent composition formulated as a detergent powder 
15 containing phosphate builder, anionic surfactant, nonionic 

surfactant, silicate, alkali to adjust to desired pH in use, 
and neutral inorganic salt. 

b) A detergent composition formulated as a detergent powder 
20 contajLning zeolite builder, anionic surfactant, nonionic 

surfactant, acrylic or equivalent polymer, silicate, allcali to 
adjust to desired pH in use, and neutral inorganic salt. 

c) A detergent composition formulated as an aqueous detergent 
25 liquid comprising anionic surfactant , nonionic surf actant 

humectant, organic acid, caustic allcali, with a pH in use 
adjusted to a value between 7 and 10.5. 

d) A detergent composition formulated as a nonaqueous deter- 
30 gent liquid comprising a liquid nonionic sxirfactant consisting 

essentially of linear allcoxylated primary alcohol, phosphate 
builder, caustic alkali, with a pH in use adjusted to a value 
between about 7 and 10.5. 

35 e) A detergent composition formuiated as a detergent powder 
in the form of a granulate having a bulk density of at least 
600 g/1/ containing anionic surfactant and nonionic surfactant. 
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low or substantially zero neutral inorganic salt, phosphate 
builder, and sodium silicate. 

f) A detergent composition formulated as a detergent powder 
5 in the form of a granulate having a bulk density of at least 
600 g/1, containing anionic surfactant and nonionic surfactant, 
low or substantially zero neutral inorganic salt, zeolite 
builder, and sodium silicate. 

10 g) A detergent composition formulated as a detergent powder 
containing anionic surfactant, nonionic surfactant, acrylic 
polymer, fatty acid soap, sodium carbonate, sodium sulphate, 
clay particles, and sodium silicate. 

15 h) A liquid compact detergent comprising 5-65% by weight of 
surfactant, 0-50% by weight of builder and 0-30% by weight of 
electrolyte. 

Apart from these ingredients, the detergent compositions a)-h) 
20 include the cellulase of the invention and optionally one or 
more other enzymes, as indicated above. 

The softening, soil removal and colour clarification effects 
obtainable by means of the enzyme of the invention generally 

25 require a concentration of the enzyme in the washing solution 
- of 0.0001 - 100> preferably 0.0005 - 60, and most preferably 
0.01 - 20 mg of enzyme protein per liter. The detergent 
composition of the invention is typically employed in 
concentrations of 0.5 - 20 g/1 in the washing solution. In 

30 general, it is most convenient to add the detergent additive 
in amounts of 0.1 - 5% w/w or, preferably, in amounts of 0.2 - 
2% of the detergent composition. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 

Fig. 1 is a map of plasmid pYHDlV, wherein "TPI promoter" 
indicates the S. cerevisiae triose phosphate isomerase 
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pr-omoter^ "Tenninatior" indicates the S. cerevisiae triose 
phosphate isomerase terminator, "Amp" indicates the gene 
mediating ampicillin resistance, "2^ ori" indicates the yeast 
plasmid 2/i origin of replication, and "URA3" indicates a gene 
5 encoding a selection marker complementing a uracil deficiency 
in the host strain; and 

Fig. 2 is a map of plasmid pHD414, wherein "AMG Terminator" 
indicates the A. niaer glucoamylase termdLnator, and "TAKA 
10 Promoter" indicates the A. oryzae TAKA amylase promoter; 

The present invention is further illustrated in the following 
examples which are not in any way intended to limit the scope 
of the invention as claimed . 

15 

EXAMPLES 

^Materials and Methods 

20 Donor organism: mRNA was isolated from the following organisms: 
H. insolens . DSM 1800, grown in a cellulose-rich fermentation 
medium with agitation to ensure sufficient aeration. 

Construction of an expression plasmid; The commercially 
25 available plasmid pYES IX (Invitrogen) was cut with Spel, 
filled in with Klenow DNA polymerase + dNTP and cut with Clal. 
The DNA was size fractionated on an agarose gel, and a fragment 
of about 2000 bp was purified by electroelution. The same 
plasmid was cut with Clal/PvuII, and a fragment of about 3400 
30 bp was purified by electroelution. The two fragments were 
ligated to a blunt-ended Sphl/EcoRI fragment containing the 
yeast TPI promoter. This fragment was isolated from a plasinid 
in which the TPI promoter from cerevisiae (cf ^ T. Albers and 
G. Kawasaki, J- Mol. AdpI. Genet, i, 1982, pp. 419-434) was 
35 slightly modified: an internal SphI site was removed by 
deleting the four bp constituting the core of this site. 
■ Furthermore, redundant sequences upstreeim of the promoter Were 
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removed by Ball exonuclease treatment followed by addition of 
a SphI linker. Finally, an EcoRI linker was added at position - 
10 ♦ After these modifications, the promoter is included in a 
Sphl-EcoRX fragment* Its effeciency compared to the original 
5 promoter appears to be unaffected by the modifications. The 
resulting plasmid pYHD17 is shown in Fig. 1. 

Isolation of mRKA: Total RNA was isolated from approximately 
7 g of mycelium. The mycelium was frozen in liquid nitrogen and 
10 ground in a mortar with 1 g of quartz sand to a consistency of 
flour. The RNA was extracted with guanidinium thiocyanate and 
centrifuged through CsCl essentially as described in Sambrook 
et al. / 1989, op. cit. . Poly A RNA was isolated from total RNA 
by chromatrography on oligo dT cellulose. 

15 

cDNA synthesis: cDNA synthesis was carried out by means of a 
cDNA synthesis kit from Invitrogen according to the 
manufacturer's specifications. The DNA was adapted to the 
expression vectors by addition of a Bstxl linker (Invitrogen) 

20 and size fractionated on an agarose gel. Only DNA larger than 
5-600 bp was used in the library construction. The adapted cDNA 
was ligated into an appropriate vector cut with Bstxl. 
Following test ligations (in order to determine the size of the 
librairy) the library was plated onto 50 agar plates. To each 

25 plate containing from approximately 500 to 5000 individual 
clones (dependent T on the library size) was added 3 ml medium. 
The bacteria were scraped off, 1 ml glycerol was added, and 
stored at -80 "C as 50 pools. The remaining 2 ml were used for 
DNA isolation. If the amount of DNA was insufficient to give 

30 the required number of yeast trans foirmants (see below) , large 
scale DNA was prepared from 500ml medium (TB) inoculated with 
50 /il -80 *C bacterial stock propagated over night. 

Construction of Yeast Libraries: DNA from one or more pools was 
35 transformed into yeast as described below. To ensure that all 
the bacterial clones were tested in yeast a number of yeast 
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transformants 5 x larger than -the number of bacteria clones in 
the original pools was set as a limit- 
Trans format ion of yeast: The yeast strain used was yNG231. (MAT 
5 alpha, leu2, ura3-52, his4-539, pep4-delta 1, cir+) . One colony 
was grown at 30' C overnight in 10 ml YPD (this culture can be 
stored for several days at 5°C) . ' 

10, 30, and 60 fil of this culture were added to 3 shei3cer flasks 

10 containing 100 ml YPD, and incubated with shaking overnight at 
30 'C. The culture with an OD^oo closest to 0.3-0.4 was selected. 
The cells were harvested in 50 ml tubes in a Beckman centrifuge 
(speed 6, 10 minutes) , the cells were resuspended in 2 x 5 ml 
HjO, centrifuged as described above, resuspended in 5 ml buffer 

15 containing 0.1 M LiAc, 10 mM Tris-Cl, 1 mM EDTA, pH 7-5, and 
centrifuged again. The cells were resuspended in 500 ;il of the 
above buffer and incubated for 60 minutes at 30*C. 250 fig 
carrier DNA (sterile salmon-sperm DNA 10 mg/ml) was added and 
aliquots of 100 nl were prepared. The DNA to be transformed 

20 (approx. 5 fig) was added to the 100 /xl aliquot, mixed gently, 
and incubated for 30 minutes at 30"C. 700^1 40% PEG 4000, 0.1 
M LiAc, 10 mM Tris-Cl, 1 mM EDTA, pH 7.5 was added, and 
incubation was continued for 60 minutes at 30 °C. The 
transformation mixture was subjected to heat shock for 5 

25 minutes at 42 *C, spun briefly in a micro centrifuge, 
resuspended in 100-200 fxl Hp, and plated on SC plates without 
Tiracil, followed by incubation for three days at 3 0*C. 

Preparation of carrier DNA: 100 mg salmon-sperm DNA was weighed 
30 out and dissolved overnight in 10 ml 10 mM Tris-Cl, 1 inM EDTA, 
pH 7,5 (TE) . The solution was then sonicated in a plastic 
container in ice water until it was no longer viscous. The 
solution was then phenole extracted and EtOH precipitated, and 
the pellet was washed and resuspended in 5 ml TE. The 
35 suspension was EtOH precipitated, and the pellet was washed and 
resuspend in 5 ml TE. The OD260 was measured, and the suspension 
was diluted with TE to 10 mg/ml. 
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Media: 

yPD: 10 g yeast extract, 20 g peptone, HgO to 810 ml. 
Autoclaved/ 90 ml 20% glucose (sterile filtered) added. 

5 10 X Basal salt: 66.8 g yeast nitrogen base, 100 g succinic 
acid, 60 g NaOH, H^O ad 1000 ml, sterile filtered- 

SC-URA: 90 ml 10 x Basal salt, 22,5 ml 20 % casamino acids, 
9 ml 1% tryptophane, ad 806 ml, autoclaved, 3.6 ml 5% 

10 threonine and 90 ml 2 0% glucose added. 

SC-H agar: 7.5 g/1 yeast nitrogen base without amino acids, 
11.3 g/1 succinic acid, 6.8 g/1 NaOH, 5.6 g/1 casamino acids 
without vitamins, 0.1 g/1 tryptophan and 20 g/1 agar (Bacto) . 
15 Autoclaved for 20 min. at 121'*C. After autoclaving, 55 ml of 
a 22% galactose solution and 1.8 ml of a 5% threonine solution 
were added per 450 ml agar. 

SC-H broth: 7.5 g/1 yeast nitrogen base without amino acids, 
20 11.3 g/1 succinic acid, 6.8 g/1 NaOH, 5.6 g/1 casamino acids 
without vitamins, 0.1 g/1 tryptophan. Autoclaved for 20 min. 
at 121'C. After autoclaving, 10 ml of a 3 0% galactose solution, 
5 ml of a 3 0% glucose solution and 0.4 ml of a 5% threonine 
solution were added per 100 ml medium. 

25 

YNB-1 agar: 3*3 g/1 KHjPO^, 16.7 g/1 agar, pH adjusted to 7. 
Autoclaved for 20 min. at 121**C. After autoclaving, 25 ml of 
a 13.6% yeast nitrogen base without amino acids, 25 ml of a 4 0% 
glucose solution, 1.5 ml of a 1% L-leucine solution and 1.5 ml 
30 of a 1% histidine solution were added per 450 ml agar. 

YNB-1 broth: Composition as YNB-l agar, but without the agar. 

CMC overlayer gel: 1% agarose, 1% carboxymethyl cellulose in 
35 Tris-malate buffer, pH 7. The gel was boiled and then cooled 
to 55*'C before the overlayer was poured onto agar plates. 
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Oat spelt xylcin overlayer gel: 1% agarose, 1% oat spelt xylan 
(Sigma Cliemical Company) in Tris-malate buffer, pH 7. The gel 
was boiled and then cooled to 55 "C before the overlayer is 
poxired onto agar plates - 

5 

construction of an Aspergillus expression vector: The vector 
pHD414 is a derivative of the plasmid p775 (described in EP 238 
023) . In contrast to this plasmid, pHD 414 has a string of 
unique restriction sites between the promoter and the 

10 teinainator. The plasmid was constructed by removal of an 
'approximately 200 bp long fragment (containing undesirable RE 
sites) at the 3* end of the terminator, and subsequent removal 
of an approximately 250 bp long fragment at the 5 Vend of the 
promoter, also containing undesirable sites. The 200 bp region 

15 was removed by cleavage with Narl (positioned in the pUC 
vector) and Xbal (just 3 • to the terminator) , subsequent 
filling in the generated ends with Klenow DNA polymerase +dNTP, 
purification of the vector fragment on gel and religation of 
the vector fragment- This plasmid was called pHD413. pHD413 was 

20 cut with StuI (positioned in the 5 'end of the promoter) and 
PvuII (in the pUC vector) , fractionated on gel and religated. 
The plasmid pHD 414 is shown in Fig. 2. 

E:gample 1 

25 

A library fr-o m H- insolens consisting of approx. 300,000 
individual clones in 50 pools was constructed. 

DNA was isolated from 20 individual clones from the library and 
30 subjected to analysis for cDNA insertion. The insertion 
frequency was found to be >90 % and the average insert size was 
approximately 1400bp. 

DNA was isolated from 10 pools from the Humicola library (2ml 
35 from the original plate) . An aliquot was digested with 
restriction enzymes in order to excise the cDNA insert eoid 
suialyzed by Southeim blot using a 43KD cellxilase cDNA probe 
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(the 43 kD enzyme is disclosed in PCT/DK91/0012 3 ) and a CBH 2 
cDNA probe (the enzyme is disclosed in PCT/DK91/00124 ) . Several 
bands were found to hybridize with the 4 3kD cellulase probe 
after a low stringency wash ( 2x SSC 65 *C) in the 10 pools from 
5 the Humicola library. At higher stringency (0.1 x SSC, 75 °c ) 
one band corresponding to the expected size for 4 3kD cellulase 
was detected in 5 out of 10 pools. Similar results were 
obtained with the CBH 2 probe. Here 10 out of 10 pools were 
found to have a band corresponding to the expected size for CBH 
10 2. In addition, 4 pools contained bands with a higher molecular 
weight. These bands were seen even under stringent conditions, 
demonstrating that the library is of an appropriately high 
quality. 

15 DNA from the Humicola library, pools 1-10, was transformed into 
yeast, and plates containing 20-25,000 colonies were obtained 
from each pool. The colonies were scraped off and stored in 
glycerol at -SCC. 

Yeast cells from the library were spread onto YNB agar to a 
total of about 400,000 colonies. The number of colonies per 
plate varied from 50 to 500. After 4 or 5 days of growth, the 
agar plates were replica plated onto two sets of SC-H agar 
plates. These plates were then incubated for 2-4 days at 30 •C 
before the two sets of agar plates were overlayered with a CMC 
indicator gel for detection of cellulase activity and oat spelt 
xylan indicator gel for the detection of xylanase and 
cellulase. After incubation overnight at 40 "C, enzyme reactions 
were visualised with Congo Red. 10-15 ml of a 0.1% solution of 
Congo Red was poured onto the over layer and removed after 10-20 
min. The plates were then washed once or twice by pouring 10-15 
ml of 2M NaCl onto the plates. The NaCl solution was removed 
after 15-25 min. Cellulase-positive colonies were identified 
on the plates with the CMC overlayer as colonies with 
colourless or pale red clearing zones on a red background. 
Xylanase-positive colonies identified on the plates with oat 
spelt xylan overlayers as colourless or pale red clearing zones 



20 



25 



30 



35 
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on a red background* Cellulase-positive colonies were also 
identified on plates with oat spelt xylan overlayers as pale 
red or blue clearing zones on a red background. 

5 Cells from enzyme-positive colonies were spread for single 
colony isolation on agar, and an enzyme-producing single colony 
was selected for each of the celltilase- or xylanase-producing 
colonies identified . 



10 Each of the 133 cellulase-producing colonies and 147 of the 
xylanase-producing colonies were isolated. Some of these 
colonies were inoculated into 20 ml YNB-1 broth in a 50 ml 
glass test tube. The tube was shaken for 2 days at 3 0" C. The 
cells were harvested by centrifugation for 10 min. at 3000 rpm. 

15 

The cells were resuspended in 1 ml 0.9 M sorbitol, 0.1 M EDTA, 
pH 7.5. The pellet was transferred to an Eppendorf tube, and 
spun for 30 seconds at full speed. The cells were resuspended 
in 0.4 ml 0.9 M sorbitol, O.l M EDTA, 14 mM ^-mercaptoethanol. 
20 100 ^tl 2 mg/ml Zymolase was added, and the suspension was 
incubated at 37 °C for 30 minutes and spun for 30 seconds. The 
pellet (spheroplasts) was resuspended in 0.4 ml TE. 90 fJL± of 
(1.5 ml 0.5 M EDTA pH 8 • 0, 0. 6 ml 2 M Tris-Cl pH 8.0, 0.^ ml 
10% SDS) was added, and the suspension was incubated at 65 *C 
25 for 30 minutes. 80 /il 5 M KOAc was added, and the suspension 
was incubated on ice for at least 60 minutes and spun for 15 
minutes at full speed. The supernatant was transferred to a 
fresh tube which was filled with EtOH (room temp.) followed by 
thorough but gentle mixing and spinning for 30 seconds. The 
30 pellet was washed with cold 70% ETOH, spun for 30 seconds and 
dried at room temperature. The pellet was resuspended in 50 /il 
TE and spun for 15 minutes. The supernatant was transferred to 
a fresh tube. 2.5 /il 10 mg/ml HNase was added, followed by 
incubation at 37 for 30 minutes and addition of 500 /xl 
35 isopropanol with gentle mixing. The mixture was spun for 30 
seconds, and the supeamatant was removed. The pellet was rinsed 
with cold 96% EtOH and dried at room temperature. The DNA was 
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dissolved in 50 iil water to a final concentration of 
approximately 100 /il/ml . 

The DNA was transformed into E>coli. by standard procedures- 
5 Two E_i. coli colonies were isolated from each of the 
transformations and analysed with the restriction enzymes 
Hindlll and Xbal which excised the DNA insert. DNA from one of 
these clones was retransf ormed into S. cerevisiae strain JG169 
(MATa; ura 3-52; leu 2-3, 112; his 3-D200; pep 4-113; 
10 prcl::HIS3; prbl:: IiEU2) and rescreened for enzyme activity. 

The DNA sequences of several of the positive clones were 
partially determined. The partial DNA sequences are shown in 
Sequence Listings SEQ ID#7-15. Based on the DNA sequence, the 
15 clones were classified as follows: 

Endocellulases : 

CMC 1: C3, 26, 27, Xy33, XY46 250 amino acids (SEQ ID#7) 

20 CMC 4: C46, 47, 50, 51, 54, 



25 



CMC 5: 



CMC 38K: 



CMC 6: 



C13 



C49 



Xy49 



101, 102, 103, 104 



"1400 bp (the enzyme of the 
invention) (SEQ ID#8) 
"1050 bp (SEQ ID#9) 
"1000 bp (SEQ ID#10) 
(SEQ ID#11) 



-CMC EGlr C6, 11, 15, 16, 17, 21, 

22, 23, 25, Xy34, 41, 145 



(SEQ ID#12) 



Xylanases : 



30 



35 



XYL 3: 



XYL 1: 



XYL 2: 



XY30, 31, 40, 42, 101, 102, 
110, 117, 119, 123, 125, 

136, XY56, 60, 137 22 JcD (SEQ ID#13) 

XY103, 104, 107, 108, 109, 

113, 114, 118, 120, 121, 124, 

126, 128, 130, 134, 142, 143 (SEQ ID#14) 

XY115, 116, 132, 146 (SEQ ID#15) 
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In order to express the genes in Aspergillus the cDNA insert 
is isolated from one or more representatives of each family and 
cloned into the vector pHD414 which is transformed into A-. 
5 orvzae or A^. niaer according to the general procedure described 
below. 

Transformation of Aspergillus orvzae or Aspergillus niaer 
( general procedure ) 

100 ml of YPD (Sherman et al.. Methods in Yeast Genetics, Cold 
Spring Harbor Laboratory, 1981) is inoculated with spores of 
A, orvzae or A. niger and incubated with shaking at 37*^C for 
about 2 days. The mycelium is harvested by filtration through 

15 miracloth and washed with 2 00 ml of 0.6 M MgSO^. The mycelium 
is suspended in 15 ml of 1.2 M MgS04. 10 mM NaH2P04, pH = 5.8. 
The suspension is cooled on ice and 1 ml of buffer containing 
120 mg of Novozym** 234, batch 1687 is added. After 5 minutes 
1 ml of 12 mg/ml BSA (Sigma type H25) is added and incubation 

20 with gentle agitation continued for 1.5-2.5 hours at 37 "G until 
a large number of protoplasts is visible in a sample inspected 
tinder the microscope. 

The suspension is filtered through miracloth, the filtrate 
25 transferred to a sterile tube and overlayered with 5 ml of 0.6 
M sorbitol , 100 mM Tris-HCl , pH = 7.0. Centrifugation is 
performed for 15 minutes at 100 g and the protoplasts are 
collected from the top of the MgS04 cushion. 2 voltimes of STC 
(1.2 M sorbitol, 10 mM Tris-HCl, pH = 7.5. 10 mM CaCl2) are 
30 added to the protoplast suspension and the mixture is 
centrifugated for 5 minutes at 1000 g. The protoplast pellet 
is resuspended in 3 ml of STC and repelleted. This is repeated. 
Finally the protoplasts are resuspended in 0.2-1 ml of STC. 

35 100 /tl of protoplast suspension is mixed with 5-25 ftg of the 
appropriate DNA in 10 fil of STC. Protoplasts are mixed with 
p3SR2 (an A. nidulans amdS gene carrying plasmid) . The mixture 
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is left at room temperature for 25 minutes. 0,2 ml of 60% PEG 
4000 (BDH 29576) • 10 mM CaCl2 and 10 mM Tris-HCl, pH == 7.5 is 
added and carefully mixed (twice) and finally 0.85 ml of the 
same solution is added and carefully mixed. The mixture is left 
5 at room temperature for 25 minutes, spun at 2500 g for 15 
minutes and the pellet is resuspended in 2 ml of 1.2 M 
sorbitol. After one more sedimentation the protoplasts are 
spread on the appropriate plates. Protoplasts are spread on 
minimal plates (Cove Biochem. Biophys. Acta 113 (1966) 51-56) 

10 containing 1.0 M sucrose, pH = 7.0, 10 mM acetamide as nitrogen 
source and 20 mM CsCl to inhibit background growth. After 
incubation for 4-7 days at 37**C spores are picked and spread 
for single colonies. This procedure is repeated and spores of 
a single colony after the second reisolation is stored as a 

15 defined transf ormant . 

Example 2 

Cellulase i:ype 4 clones C46 and C51 and a 43 kD cellulase 
20 control clone (obtained by transforming yeast strain JG169 with 
pYHD17 carrying a DNA sequence coding for the 4 3 kD cellulase 
[isolated as described in PCT/DK9 1/000123 ] ) were inoculated in 
100 ml test tubes with 15 ml YNB-1 broth. The tubes were 
agitated at 30 °C for 2 days. 5 ml of broth from each tube were 
25 then used as seed material for shake flasks containing 100 ml 
SC-H broth. The shake flasks were agitated for 4 days at 30*C. 
The cells from 20 ml of broth were collected by centrif ugation 
and mixed with 1-2 ml 0.1 M sodium phosphate buffer, pH 7, and 
3.3 g of glass beads (420-500 fnn in diameter) in 10 ml glass 
30 test tubes. The crude cell extracts were collected after about 
8 minutes of agitation by means of a IKA vibrax VXR (available 
from IKA Labortechnik) . 

The cellulase activity of the crude cell extracts from the 
35 yeast clones C46, C51 and 43 kD were measured under different 
conditions by the size of the clearing zones formed in CMC 
containing gels. 
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CMC crel: CMC overlayer gel as described above. 



CMC IAS gel ; 2% agarose, 1% CMC in 50 mM sodiiaia phosphate 
buffer, pH 7, boiled and mixed with one volume of 0,12% IAS. 

5 

The cellulase activity was measured by adding 15 ^1 crude cell 
extract to 4 mm (diameter) holes' in the gel. The crude cell 
extracts were diluted with one voliame of 6.12% IAS before 
addition to the CMC IAS gel and with one volume of water before 
10 addition to the CMC gel . The clearing zones were then 
visualised after 18 hours of incubation at 40'C by staining 
with Congo Red as described above. 



25 





The results are 


shown in the 


following 


table. 


15 












Clone 


C4 6 


C51 


43 kD 


20 


CMC 


14 


14 


17 




CMC IAS 


15 


13 


0 



Activities are shown as mm clearing zones 



It appears from the table that the enzyme produced by C46/C51 
is IAS resistant. 
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SEQUENCE LISTING 



5 



(1) GENERAL INFORMATION; 



(i) APPLICANT: 

(A) NAME: Novo Nordisk A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 
10 (E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2880 

(G) TELEPHONE: +45 4444 8888 

(H) TELEFAX: +45 4449 3256 

(I) TELEX: 37304 



15 



20 



30 



(ii) TITLE OF INVENTION: A Method of Cloning Proteins in 
Yeast 

(iii) NUMBER OF SEQUENCES: 15 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

25 (D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Humicola insolens 

(xi) SBQCJENCE nESCKIPEIC3N: SEQ ID NO: 1: 
45 TGGCAQGAGr GTOGTGGCGr TSGCITCKX; GGCTCIAOKr CCJXSIGTGIC OGQTEACAOG 60 
TQOGaPGEACT TGAACGACTG CTACAfiOCAA TGC 93 

(2) iNrooRManoN for sbq id no: 2: 

50 

(i) SBCPENCE CHZ^RACEERISTICS: 

(A) lENGIH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) SIRANDE32IESS: single 
55 (D) TOPDIOGY: linear 
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(ii) MDIECCIE T5fEE: cW?^ 



5 



(vi) CJEilGINAL SCORCE: 

(A) QRcaNISM: Humioola insolens 



10 



(xi) SBCPENCE DESCRIPriON: SBQ ID NO: 2: 
CftGOGCSiGOC GAOGAOSnA OQGAGAAGAC AAC3A0GACC AQQQCAACaT OSlCZACAaG 60 

GaxaGcax3s GCKKxacrr caaocactcc gg 92 

(2) INEORMftlErON FOR SBQ ID NO: 3: 

15 (1) SBQOTNCE CHARACEEECCSTICS: 

(A) IfNGIH: 132 base pairs 

(B) TXEE: nucleic acid 

(C) STBMlDELtiESSz single 

(D) TDPOIOG^: linear 



20 



(ii) MDIBCDIE TXFE: cENA 



25 



(vi) ORIGIIB^ SODECE: 

(A) C3RGANISM: Humicola insolens 



30 



(xi) SOEGEIENCE DESCRIFriON: SBQ ID NO: 3: 
CXaAGQOGAA CTTCAACTGG TTQGCftTCAA (X^lSIXXTCC GCTGAGTTCG GCAAGGftGAG 60 
TftTOOQGCEA TOGGCAftGCA CIIACrrCCr TOGOGAOCTC GIDGMTCAA GOSCACftTCA 120 
AaXXaDGGCTT CA 132 
35 (2) INFDKMMniCN FDR SBQ ID NO: 4: 

(i) SBQCIENCB CHARACrEEOSTICS: 

(A) lENSIH: 69 base pairs 

(B) TXPE: nucleic acid 
40 (C) SHIANDEZXIESS: single 

(D) TOPOIDG3f: linear 

(ii) MDIECOrE TXEE: cCNA 

45 (vi) QEOiGINAL SODKCE: 

(A) GRGiOIISM: Humicola insolens 



50 



(xi) SBQDENCE DESCKEPEICN: SEQ ID NO: 4: 

c!raoGaxaA ogtgaccaac aacaacedog coGEAsasac ccaGaAcaaG cdsdseaccs^ 60 
GaTQca!rcA 69 
55 (2) iNPosraarrcN for seq id no: 5: 
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(i) SBQCJENCE CHARACrEEOISTICS : 

(A) lENGTIH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 
5 (D) TOPOIDGY: linear 

(ii) jyOIECDLE TYPE: CENA 

(vi) QECTGINAL SCXJPCEt 
10 (A) CSE^GANISM: Humicola insoleris 

(xi) SBQCJENCE DESCRIPnCN: SEQ ID NO: 5: 
15 QGAOQGTOCXS GCAOSAGGAC CGCC1X2CGTC AGCACXDCAGG TCXSGCCTTCA G0GC3CTCAIT 60 
QGC3G0GACCA AGTOGCTGAG GC3^AAACX3GC AAGGTIGGAC TCCTCXXXSAC TIG0C3G0GGC 120 

20 (2) INFDRMATrON FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARAGTEEO^TICS: 

(A) XENGTIH: 120 base pairs 

(B) TIPE: nucleic acid 
25 (C) SERANDEDNESS: single 

(D) TOPOI£)GY: linear 

(ii) MOIECCJIE TOTE: cDNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Humicola insolens 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCCAAGroOG TTTGCCAGCA GGCGATTGAG GGCAIGCIGA ACX3^OCT0CA GGAGAA3AGC 60 
GA3X3ICroGA CAGGiraX^ CIGG?]X3QGOG GGAGGOOOGT GGTrSGGGrnG ACEATATCEA 120 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACnSRISTICS: 

(A) lENGOH: 1027 base pairs 
45 (B) TORE: nucleic acid 

(C) STRANDEENE5S: single 

(D) TOPOIDGY: linear 



(ii) MDIECUIE TOTE: cENA 

(vi) GRIGINMj SOURCE: 

(A) ORGANISM: Humicola insolens 



55 (xi) SEQUENCE DESCRIFTICN: SEQ ID NO: 7: 
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CSaXXXCrm OSVCCACCAQC TCTACIGCAG ACXJITCTCCA ATTTCTOQGA TCAOOGOCAT 60 

GCTCAAGTCT GCTCICCrOC TOSQXXXISC GGOOCTITaS GrTCAGTrOGG CCTOGaiCOC 120 

5 CS^CO^TOOOG GCaAAaHTC AGCXTOGOCA GMTOGCEaG CTCTSDGaGC TSmOGGCIA 180 

CIGGTODQGC AMGOnaiG AG^ 240 

OSGCTXSGCaA TQCACCIkCC TOCaOGGCAC CAACaAOQGT QGCaTTGACT GGAGC&COGC 300 

10 

GEQQGAGTGG CaCQGCXSCTC OSS^CAACCT CAAGAGCEftC CCCrait?riG GCaaOCaGaT 360 

CCaGOGOGGC OSCaACaTCA GC3GACMGAA CAGC3ffi3C3GC AOGTOQCTGT CJCTGGAOCTA 420 

15 asarootacc gacatcogig ccs^atgtogc TEMxaaxsrc ttcaosgcic GrrGaTOooGA 48a 

(XATOOCaAC TOQGGOGQOG AGTAOaGCP OmSATCIGG CTOGCODQCT ATOGCXSGCftT 540 

ciaococaTC GGCAO^rrcx: acs^gccaggt cs^ccrrGcr gctogiaoct GOGarorcro 6oo 

20 

GACTCGCIAC AAOQGCAACA TCOCTCTCIA CAGCTTOCTC CCXXX3CTCX3G GC3GAGa37EOG 660 

TGACITCaGC TGOCaCATCA AQGACITCIT CAACaACCrP GAGOGC2^CX: ADGGCIAC3C3C 720 

25 GGCAaSGGAG GAGAATCTGA TaSTCIAOCA ASEDQGAACX: GAGTCCTICA aSGGCGGrTCC 780 

GGCXS^QGTTC AOGTCCaGGG ACTTGAGQGC TGZ^CCTCTGG lAflGAiGGGGT CaaTOGAGTA 840 

GGGIXnACIT GOCAGGAAGC GGAGAGGAAC AAGSIAGaaA TAMX3ACAGA CAASTCaiTG 900 

30 

GftTCCaGIAA ACAOGCTTOG IXXTTCMDOG GCAAGAdTC CS^ASTGOCEA GACAACAGEG 960 

CTCAGGGGAC ACAGGGAOGA AGAIGGATOC TCCAAAACTC ACaAaTOGGT AGCAaT0GC3C 1020 

35 TTEAGaG 1027 

(2) INPOBMaCTCN K3R SEQ ID NO: 8: 

(i) SEffJMCE CHARACTERISTECS: 
40 (A) imamz 872 base pairs 

(B) TWEi nudelc add 
(C} SIE^ANEIEEIIESS: single 
(D) TOPOIDGZ: linear 

45 (ii) M3IECDIE TXEE: cENA 



50 



(vi) qriginm:! sc 

(A) QE^GANISM: Humioola insolens 



(xi) SBQDENCE DESCKCPTICN: SBQ ID NO: 8: 
AAAGCXnCAA CACiaTIAOC AaCTTCGACA CTXSICCTIGC OGGTCTCTIX:: GOGACTGGAG 60 
55 OSCrOQCCCk GGGOCTGCAT QGCAGCAGIG TOSDOGCSTT GQCTTCTOGG GCTCiaaSIC 120 
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CTSTGICTCC QGnAGAOGT GOCTSEACIT GAAOGACIGG TACAGOCAAT GCGAGOGGAG 180 

CX^GAOGAOGT TAOQGACAAC ACAAGAAOGC CAGGGCAACA TOGACAACAA GGTCAGOOCX: 240 

5 GGCTGCCACr TCAACCACTC OGGCXS^AGGC GAAGTTCAAG TGGflTOGCAT CAACXZAGnCC ' 300 

TGOGCTGACT TOGGCSVAGGA GAGEATCX3QG CEATGGGCAA GCa^CTTACTr CCTIOGOGAC 360 

CTOGKXSATT CAAGOGGACA TCAATCX3DGG dTCAGAIICT CXNNNNNQGC AACrCTGGAC 420 

10 

GGCTGAOSTG AAC3CTGAOCA ACAACAACIT GGCOGTCAGOG ACX3GAGAACA AGCriCTGTAC 480 

CAGAIGCA!rC AGOCACXnOGA dOGGAOQCT OC3QGCAOGAG CAGGGCXTCC GICMXACCC 540 

15 AGGTOGQCXT TCAGOGOGTC ATTGGOGOGA CC3ACICGCT C203GAAAAC GGCAAGGTIG 600 

GACrOCTOGC GACrrGOC3GC GGOGCXAAGT GGGTITCCXA GCAGGCCATT GAGGGCATGC 660 

TGAACXaCCT CXaGGAGAAT AGCGAIXJICT GGACAGGO^ GCrCIGGTGG GOGGGAGGCC 720 

20 

CXJK^GTGGQG TGACEATATC TAG?iasmG AACCICriOG QJEATIGGCr ACACXHACEA 780 

tAATiaXTT CTOVAGAAZ^ AOCTGOCMA GGTGCEATAA GKCCOICGTC CTTSACTCAAG 840 

25 AGGGTrrGAC AQGAAOSCAG OCCIGAGGCT TT 872 

(2) - INTOIMAaiEON^^ 9: 

(i) SBC2CJENCE CHARACTEKESTICS : 
30 (A) lENGTIH: 368 base pairs 

(B) TOTE: nucdLeic acid 

(C) STRANDECNESS: single 

(D) TOPOIiDG^: linear 

35 (ii) MDIECUIE TOTE: cENA 

(vi) ORECTNAL SOURCE: 

(A) ORGANISM: Humicola insolens 

40 

(xi) SEQUENCE DESCRIPnON: SBQ ID NO: 9: 

GFTGAAGGCC CDGAACAGAG GAOCICACGr CCOGAAAATC TOCAGGGCEA CQGAGEATGT 60 

45 ACAGAATATC CACAACCAAA CAAAACTCAA TTGAOCTCOG ATOCXIAACAT GACAAOOXT 120 

TCS^QGCDGGG TOQGAAOCAA CIADCEAGCC AGOCTCICTC CATTTTOCAT CTCCEAACAC 180 

CAAOOOCCCP OCAAITCICIG CCOCTEAAAT OGGGITCACC CGAATOGATC CAGATOOGCA 240 

50 

AACACCAOCr GC3CAACCATC OSCTGOGOCC GCOCTOGCTG AAGAAOGACT TGGCOGGTCr 300 

OGnTQCOGGA GCAGCCACTC GAOCACTftGC TCGCXDCOOGT GOGAACITGC CGTICTAGIA 360 
55 OCATAGGT 
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(2) INPQEgffianiQN for SEQ id NO: 10: 

(i) SBQQENCE CHARACTEKISTrCS: 

(A) IZtKUHi 720 base pairs 
5 (B) TXEE: nucleic acid — 

(C) SIRaNDKHNESS; single 

(D) TOP0IDG5f: linear 

(ii) ISDIEODLB TXEE: cWh 

10 

(vi) ORIGINaL SOORCE: 

(A) QE^GAHISM: HumicGla insolens 



15 (xi) SBCPENCE DESCEOTTION: SEQ ID NO: 10: 

NaaaGGCSWZC AaGGTGAOGG OGTCAOCrOG GGOGAGIX3GG AGAOGaTCOG GKECAOOGAG 60 

AAOCACTCCT CJGCaOGQXC OGTCAOGCaC GRaOCTOGC OGGCXATCAC GTGCIAGGAG 120 

20 

AAGACXSCXXE GOCAGOGGGC CCNAAGAOGG TGAAOGTCOG GGOOGGOSCG AOOGTCAOCT 180 

TGACXCTCTA CAaSGAOSIG GGCX3VOOa3G GOGCTOCACT TCfEAOCTGGC CAAGGTOCX3G 240 

25 OGGCAaGAOG GCOGOGAOCT TIGAOGQCAA GGGOGOOSDG TCGITCAAGA TTIAGCAfiGA 300 

OSGCXnGQIG CnroOGACAG CEOnTXSAOC TOGOCAO^^ TCGCAAGAAG AGTCTOGTCA 360 

MMJNNNNAC CTGGCCIAC3G TIGGCAAGAA GAASTCTOSP CAftTCXXXXG TCOGTCXAGG 420 

30 

ACXSGOGZiGIA COC33GOC3GOG TOGAGCACAT TGCTTGCACA QGOSCXaGOG TOGGOGGTCG 480 

GCAGCTCrAC ATTrOGTCOG OGCAAMXAA CGO^CAOGGOS GCACOQGCAC GCTCAACCaS 540 

35 GGOCAGCIC3G TCTOGITCCC GQGOSOCIAC AAGCXXAOOG ACOOQGGCAT CCTOCTOCAG 600 

CICIACIX3GC C3GCOGCOGAC GCASIACAEK: AAOCXX39GTC OGGOGCOSGT GAASTSCDGA 660 

GaTTCAGTTC AKSiSERCrC CAA!IX3AAQGT OmSCXSGCGG OGAGQGIAGG TOGftaAGTIT 720 

40 

(2) INFORMAIION FOR SEQ TD NO: 11: 

(i) SEQOENCE CHARACTERISTICS: 
45 (A) LENGIH: 724 base pairs 

(B) TWEz nucleic acid 

(C) STRfiNDEENESS: single 

(D) TOPOIIDGY: linear 

50 (ii) MOIEamS TXEE: cENA 

(vi) ORIGINAL SOQE^CE: 

(A) QBG?^NISM: Humicola insolens 

55 

(xi) SEQPENCE DESCRIPEICN: SEQ ID NO: 11: 
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CXaiAGGOXDGC CX3lCXavrGC3G OGTITCrCTr GCTCrCCTOG CdACCIGCT CAGCGCC^GCC 60 
OOGGCXTOGC COGTCCOSGA GCTCGAGCCC OGGGAGTCOOG GCAACCCCTT CrCXX3GCCX3C 120 
5 ACXXnGdGG TCAACTOQGA C]M3^GCAGC AAGCTOGACC AGAOSOGCSCA GGCCITrCXT 180 
GTOOOGOQQC AOGCIGCGAA GG?rCAAGIAC GTOCAGGAGA AQCTIGGCAC 240 

dTi tr im ' iG GAcrrooAc AicTiccrcc tgogcagcac TGAOGrncoc aicgagaaig 3oo 

10 

CX30GCX3Q(XA AQGCOGOGOG AGAACCCGAT OCTOSGTCTC GnXTCrEACA ACCTCCCOGA 360 

OCXSOSACTGC AGOGAOGCGG GAGTEACXncr GGOGACXnTA AGCTCTCCCA GAAOGGCJCIG 420 

15 AACXDCSGTIACA AGAAOGAGTIA CSGTCAACCXX; TTOGCCXAGA AGCTCAAGGC OGOCaXXXSAC 480 

GrOGACSirOG .CCGICNrCCr OGAGCCOGAT GOCATOGGGA ACATCGTCAC GGGCAOCZAGC 540 

aXTICIGCX: GCAAOGCCOG OQGOCXJTCAG AGGAGGOCAT CXXSCTATCCT ATCTCIOCTC 600 

20 

QGdGGGCXDG ATAAGCTOGA GCGAACTGCC CAGGAGGTCC CACCATCCTC CAAAAGGCOG 660 

GEAACAAOGC AAGATOGOGG CITCTGAGCA AOnTOCAAC TACAACXICAT TCAOGACAAC 720 

25 OGCG 724 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SBC3UENCE CHARACTERISTICS: 
30 (A) IZNG?IH: 71 base pairs 

(B) TifPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOIiDG^r: linear 

35 (ii) MOIECDIE T£PE: cENA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Humicola insolens 

40 

(xi) SBQCJENCE DESCRIPTION: SEQ ID NO: 12: 

CCTOCrrOCA GKTTIGACT TOCTTOGGAC CIGCAGOGrrC CTQAACAACT OGCTCTAGCr 60 

45 CAACAACCAT G 71 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SBQDENCE CHARACnEKISTICS : 
50 (A) IZNGTH: 572 base pairs 

(B) TXEE: nucleic acid 

(C) STOANDECNESS: single 

(D) TOPOIOGY: linear 

55 (ii) miEOJIB T£PE: cDNA 
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(vi) ORIGmaL SCfQRCE: 

(A) QE^GANISM: Humicx>la insolens 



lO 



(xi) SEQjmCE DESCKTPIION: SBQ ID NO: 13: 
QQOOSIOGCC GCTTCrTCAG TIGIGIACGh TCaTOCaGCA ACraGC2U3IT CACCXTOGTC 
TOGCTCAACT CIXnXXTOGC GGCOGOCAOG GCTSTGAGCT CIGCaOTGC TCX300CTTIT 

GAcnosrrc ctogggacsa ctcgacqxc cmaGGcrc GcxaGGixac cocxs^j^ogoc 

CaGGGCTGGC AGAACEGCIA TGSrGGaXXD& AOGQOQGAGG OCAGGITCaG 

15 TACACXS^ACX:: TOGAGGGCAG CX3QCIACCftG GOXaGaiGGC GTAAGACX3QG OACITOGTC 
GGTGGiaAGG GETSGAACXX: GGGAAC3CX3GC OGGAOGKTCA ACrAOGGOQG CTACITCAAC 
CXXXAGQGCA AOQGCIACCT GGCXDGTCEAC QGCTOGACOC GCAACCX3GCP CCTOSACTAC 

20 

EmsEcairoG agixdgiaoqg caogiacaat ocx3Qgcagcx: aggcicrgia caaqqgcaca 

TTCTaaAODG A0GGC3GMXA GIAIGACATC TP33SIEAGCA CXX3CTCACAA CGAGCCXAGC 
25 AIGAOQGCAC CCGCACGTCC AGCTAGIACr GG 



(2) INFDEayaiTON FOR SBQ ID NO: 14: 



30 



35 



40 



(i) SBQDENCE CHaRACEEfOSTICS: 

(A) LEMSm: 173 base pairs 

(B) TXEE: nucleic acid 

(C) SIRZVNDEaJNESS: single 

(D) TOEOIDGY: 1 inear 

(ii) MDLECOIE TOTE: cSDNA 

(vi) OKCCnNAL SOQRCE: 

(A) QRGaNISM: anoicola insolens 



(xi) SEQDENCE DESGEaFTION: SEQ ID NO: 14: 
AftAGaffiGATC GKXJEGCTOC CTAGAAAOCA GTCaCECMT CACAMX30CT TCT3V!E0GCrC 
45 TOSCTCrTCC OXSOGGCrOOG GGOCrTCCTOG OOCAGrOOCA GCTCTQGGOC AGrGOGGaDGG 
GKTOQGCEQG AAOQGCCIAC GACTTGOGTC TOGQGOGCIA CXnGCAOC3A GftT 



50 



55 



(2) INPOiRMZffilCN FOR SBQ ID NO: 15: 

(i) SEQdENCE CHaRACEERrSTrCS: 

(A) LEtKSEH: 214 base pairs 

(B) T£FE: nucleic acid 

(C) SaiRANDECNESS: single 

(D) TOPOIDGZ: linear 



60 
120 
180 
240 
'300 
350 
420 
480 
540 
572 



60 
120 
173 
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(il) MOIBCUIE lYPE: cDNA 



5 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Humioola insolens 



(xi) SBQCJENCE DESOOTTION: SEQ ID NO: 15: 

AAAGOOQCAG CXSftTCAAGAA CXXaAGCACT CICTCAAAAT GGTOGCCTrC TOGTIOCXTrCT 60 

•KXaraGGEDQC IT0CAT0C3CC GCCAOGOGTC GGCC3GG0CXX: GGTGTAGCIG OCOGGCATCC 120 

AOCTGAACAA GOSTCAGAOC TACACOOGAG AGOGCEACOG GGACTCACAA OGGCEACATC 180 

15 TKnXX3TCT GGACTGAOGG TCTOGOOGAA OCTC 214 
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CLAIMS 

5 1. A metJiod of screening foir a DNA sequence coding for a 
protein of interest, the method comprising 

(a) cloning, in suitable vectors, a DNA library from an 
organism suspected of producing one or more proteins of 

10 interest, 

(b) transforming suitable yeast host cells with said, vectors, 

(c) culturing the host cells under suitable conditions to 
15 express any protein of interest encoded by a clone in the DNA 

library, and 

(d) screening for positive clones by determining any activity 
of a protein expressed in step (c) . 

20 

2. A method according to claim 1, wherein the DNA library is 
a cDNA library prepared from the mRNA of an organism suspected 
of producing one or more proteins of interests 

25 3. A method according to claim 1 or 2, wherein positive clones 
isolated in step (d) are stibjected to rescreening, reisolation 
and recloning. 

4. A method according to claim 1 or 2, wherein the organism 
30 suspected of producing one or more proteins of interest is a 

eukaryotic organism. 

5. A method according to claim 4, wherein the eukaryotic 
organism is a fungus. 

35 

6. A method according to claim 4, wherein the eukaryotic 
organism is a plant. 
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7. A method according to any of claiins 1-6, wherein the protein 
of interest is an enzyme. 



8. A method according to claim 1, wherein the yeast host cell 
5 is a strain of Saccharomvces cerevisiae , Schizosaccharomyces 

Dombe , Hansenula . Pichia, Yarrowia lipolvtica or Kluyveromyces 
lactis ^ 

9. A process for producing a protein of interest in a 
10 heterologous host cell, the process comprising transforming a 

suitable heterologous host cell with a DNA sequence coding for 
a protein of interest, which DNA sequence has been isolated by 
the method of claim 1, culturing the transformed cells under 
suitable conditions to express the protein, and recovering the 
15 expressed protein from the culture. 

10. A process according to claim 9, wherein the DNA sequence 
coding for the protein of interest has been isolated by the 
method of claim 3 . 

20 

11. A process according to claim 9 or 10, wherein the protein 
of interest is an enzyme. 

12. A process according to any of claims 9-11, wherein the host 
25 cell is a strain of Aspergillus , for instance a strain of 

Aspergillus orvzae or As pergillus niger . 

13. An enzyme which exhibits cellulase activity, and which has 
the following characteristics 

30 

(a) the DNA sequence encoding the enzyme has been isolated from 
a DNA library of Humicola insolens , 

(b) said DNA sequence comprises at least one of the following 
35 partial sequences 
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15 



(i) 



(ii) 



111) 



(iv) 
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TGGCAGCAGT GTGGTGGCGT 
CCTGTGTGTC CGGTTACACG 
GTACAGCCAA TGC 
(SEQ ID#1) 

CAGCGCAGCC GACGACGTTA 
AGGGCAACAT CGACAACAAG 
CAACCACTCCG G 
(SEQ ID#2) 



CCAAGGCGAA 
GCTGAGTTCG 
CTTACTTCCT 



GTTCAAGTGG 
GCAAGGAGAG 
TCGCGACGTC 



ATCGTGGCTT CA (SEQ ID#3) 



TGGCTTCTCG 
TGCGTGTACT 



CGGACAACAC 
GTCAGCCCCG 



TTGGCATCAA 
TATCCGGCTA 
GTCGATTCAA 



GGCTCTACGT 
TGAACGACTG 



5 



AACAACGACC 
GCTGCCACTT 



CCAGTCCTGC 
TGGGCAAGCA 
GCGCACATCA 



CTGACGTGAA CGTGACCAAC AACAACTTGG CCGTAGCGAC 
CGAGAACAAG CTGTGTACCA GATGCATCA (SEQ ID#4) 



20 



(V) 



GGACGGTCCG GCACGAGCAC GGCCTGCGTC AGCACCCAGG 

TCGGCCTTCA GCGCGTCATT GGCGCGACCA ACTGGCTCAG 

GCAAAACGGC AAGGTTGGAC TGCTCGCGAC TTGCCGCGGC (SEQ 
ID#5) 



25 



(Vi) 



GCCAAGTGGG TTTGCCAGCA GGCCATTGAG GGCATGCTGA 

ACCACCTCCA GGAGAATAGC GATGTCTGGA CAGGTGCGCT 

CTGGTGGGCG GGAGGCCCGT GGTGGGGTTG ACTATATCTA (SEQ 
ID#6) 



30 



(c) the enzyme comprises a cellulose-binding domain, and 

(d) ttie enzyme exhibits endocellxilase activity in the presence 
of linear aU^l benzene sulfonate. 



14. An enzyme according to claim 13 , a crude extract (15 /xl) 
35 of which diluted with one volume of 0.15% linear allcyl benzene 
sulfonate and added to a 2% agarose gel containing 1% 
carboxymethyl cellulose in 50 mM sodium phosphate buffer. 
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pH 7, mixed with one volume of 0.15% linear alkyl sulfonate 
forms a clearing zone in said agarose gel after 18 hours of 
inciibation, which clearing zone is equal to (less 3 nan) the 
clearing zone formed in a similar carboxymethyl cellulose gel 
5 not containing any linear alkyl benzene sulfonate, provided 
that the concentration of enzyme in the extract is such that 
a clearing zone of at least 10 mm is formed in a carboxymethyl 
cellulose gel (with no linear alkyl benzene sulfonate) under 
the conditions specified above. 

10 

15. A detergent additive comprising an enzyme exhibiting 
cellulase activity according to any of claims 13-14, preferably 
in the form of a non-dusting granulate, stabilised liquid or 
protected enzyme. 

15 

16. A detergent additive according to claim 15, which further 
comprises one or more other enzymes such as a protease, 
amylase, lipase or peroxidase. 

20 17. A detergent composition comprising an enzyme exhibiting 
cellulase activity according to any of claims 13-14, 

18. A detergent composition according to claim 17, which 
further comprises a linear alkyl benzene sulfonate surfactant. 

25 

19. A detergent composition according to claim 17 or 18, which 
further comprises one or more other enzymes such as a protease, 
amylase, lipase or peroxidase. 

30 
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Box in TEXT OF THE ABSTRACT (Gontlntiatton of Item 5 of the lint sheet) 



Method of screening for a DNA sequence coding for a protein of interest, 
the method comprising 



(a) cloning, in suitable vectors, a DNA library from an organism suspected 
of producing one or more proteins of interest, 

(b) transforming suitable yeast host cells with said vectors, 

(c) culturing the host cells under suitable conditions to express any 
protein of interest encoded by a clone in the DNA library, and 

(d) screening for positive clones by determining any activity of a protein 
expressed in step (c) . 

An enzyme which exhibits cellulase activity and has been isolated from DNA 
library of Humicola insolens. The enzyme has a cellulosebinding domain and 
exhibits endocellulase activity in the presence of linear alkyl benzene 
sulfonate. 
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